Analysis of similarity/dissimilarity of DNA sequences based on adjacent nucleotide pair representation
نویسندگان
چکیده
Introduction of graphic representation for nucleotide or protein sequences can provide intuitive overall pictures as well as useful insights for performing large-scale similarity analysis. In this paper, we are analyzing the similarity/dissimilarity of the mitochondrial genome sequences from twenty four mammal species. The analysis is important in finding the relatedness among the species and eventually finding the evolutionary relationship. The evolutionary tree or phylogenetic tree is constructed by Unweighted Pair Group Method with Arithmetic Mean (UPGMA). The graphical representation of DNA sequence using Adjacent Nucleotide Pair has been constructed. The distance matrix required for the construction of phylogenetic tree has been built by applying the biological geometry method. Keywords—Graphical representation; similarity analysis; sequence comparison
منابع مشابه
An Evolutionary and Phylogenetic Study of the BMP15 Gene
DNA sequence data contains a wealth of biologically useful information. Recent innovations in DNA sequencing technology have greatly increased our capacity to determine massive amounts of nucleotide sequences. These sequences can be used to specify the characteristics of different regions, interpret the evolutionary relationships between categorized groups, likelihood of performing multiple com...
متن کاملIntraspecies Gene Variation within Putative Epitopes of Immunodominant Protein P48 of Mycoplasma agalactiae
P48 protein of Mycoplasma agalactiae is used to diagnose infection and was identified as potential vaccine candidate. According to the genetic nature of mycoplasma and variable sensitivity in P48-based serological diagnosis tests, intra species variation of P48 nucleotide sequence investigated in 13 field isolates of difference province of Iran along with three vaccine strains. Samples were col...
متن کاملThe similarity/dissimilarity analysis of protein sequence based on nucleotide triplet codon
Based on nucleotide triplet codon, a graphical representation of protein sequences is outlined. A numerical characterization including the location, number and distribution information of all the 20 kinds of amino acids is proposed. The similarity/dissimilarity analysis of ND5 protein sequences of nine species is done, and our approach is compared to other approaches recently proposed based on ...
متن کاملPhylogenetic and sequence analysis of the growth hormone gene of two sturgeons, Huso huso and Acipenser Gueldenstaedtii
In this study, the cDNA Growth Hormone (cGH) of the Belugasturgeon (Husohuso) and Russian sturgeon (Acipensergueldenstaedtii) were cloned and sequenced, and phylogenetic relationships were examined using nucleic acid and amino acid sequences. The nucleotide sequence of the Beluga GH has an open reading frame of 645 nucleotides encoding a protein 214 amino acid residues. The signal peptide cleav...
متن کاملA Novel Graphical and Numerical Representation for Analyzing DNA Sequences Based on Codons
One important task in the study of genome sequences and mutations is to determine densities of specific nucleotides and codons. The graphical representation of DNA sequences provide a simple way of viewing, storing, and comparing various sequences. In this paper, we first present for each kind of codon, a numerically representation as a 2D coordinate (x,y) and give a 2D graphical representation...
متن کامل